Browsing and Matching − Scoping Status: Final Report Overview of the Current Situation: Matching the Multilingual Approach in Today's Search Engines Universal Character Set

نویسنده

  • Marc Wilhelm Küster
چکیده

1 Executive summary Today's society is on its way from a traditionally production-based economy to a knowledge-based economy. The European Commission's action plan on Europe's way to the information society 1 outlines some of the major developments in this field and recommends steps to be undertaken to prepare Europe for this challenge. Obviously, the Information Society is not only about information, not even only about access to information, it is also about locating relevant information. In many ways, information retrieval is the Web revolution's neglected child. Even the otherwise excellent Information Society Glossary 2 does not refer to this crucial topic. Of course, search engines, portal sites, and indexing services do exist. However, in contrast to many of the other topics in this field, the question of locating information involves not only international standards, but also specifically European, national, regional, social, and even personal factors. Many of these issues are related to Europe's multilingual and multicultural heritage which European institutions, including standards bodies such as CEN/TC304 »European localization requirements«, must strive to protect. The issues encompass points such as: − Existence of relevant information in many languages; − The use of different scripts (e. g. Latin, Greek, and Cyrillic scripts); − The use of letters which are particular to a given language or a number of languages; − Expectations how such letters or scripts are handled in more restricted character sets such as ASCII (fallback, transliteration, input methods); − Familiarity with certain cataloguing schemes / database categories specific to a country / a group of countries. The task soon becomes more ambitious. Human readers 3 will naturally recognize that sing, sang, sung 4 are just three tenses of the very same verb, just as oeil and yeux differ only with respect to number. They will also not mix the German word Boot with its English homograph of completely different meaning, 5 whereas they understand at once that Pericles, Perikles and Περικλη Ä ς are really one and the same person 6 and that browsing and scanning can be synonyms 7 in some contexts but not in others. 8 For English, with its fairly limited number of irregular verbs and its otherwise rather regular construction of derived forms, some of these problems can still be dealt with relatively easily in comparison with most other European languages where word formation is more complex. While no speedy solution is to be expected, these …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Correctional and education center in Iran: analysis of the current situation and design of the desired situation

Introduction: Children and adolescents build the future of a country. When they commit a crime based on circumstances and are kept in a correctional center, their age needs should be taken into account in order to prevent them from committing a crime again. The purpose of the current research is to identify the most important problems and challenges in the management and treatment of children a...

متن کامل

Correctional and education center in Iran: analysis of the current situation and design of the desired situation

Introduction: Children and adolescents build the future of a country. When they commit a crime based on circumstances and are kept in a correctional center, their age needs should be taken into account in order to prevent them from committing a crime again. The purpose of the current research is to identify the most important problems and challenges in the management and treatment of children a...

متن کامل

Browsing and Matching − scoping

1 Executive summary Today's society is on its way from a traditionally production-based economy to a knowledge-based economy. The process cannot be stopped. The European Commission's action plan on Europe's way to the information society 1 outlines some of the major developments in this field and recommends steps to be undertaken to prepare Europe for this challenge. Obviously, the Information ...

متن کامل

Fractured Reservoirs History Matching based on Proxy Model and Intelligent Optimization Algorithms

   In this paper, a new robust approach based on Least Square Support Vector Machine (LSSVM) as a proxy model is used for an automatic fractured reservoir history matching. The proxy model is made to model the history match objective function (mismatch values) based on the history data of the field. This model is then used to minimize the objective function through Particle Swarm Optimization (...

متن کامل

Internet searching and browsing in a multilingual world: An experiment on the Chinese Business Intelligence Portal (CBizPort)

The rapid growth of the non-English-speaking Internet population has created a need for better searching and browsing capabilities in languages other than English. However, existing search engines may not serve the needs of many non-English-speaking Internet users. In this paper, we propose a generic and integrated approach to searching and browsing the Internet in a multilingual world. Based o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001